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A  Disturbance  Attenuation  Approach  To  Missile  Guidance  And  Control 

1.  Objective  of  the  Research  Effort 

Solutions  to  stochastic  control  problems  currently  do  not  produce  mechanizable  missile 
control  laws.  To  fill  the  gap  without  making  excessive  structural  assumptions,  it  is 
suggested  that  the  disturbance  attenuation  problem  be  extended  to  nonlinear  guidance  and 
control  problems.  This  deterministic  approach,  which  does  not  include  all  the  properties 
of  the  stochastic  control  solution,  does  have  characteristics  which  are  quite  appealing.  For 
example,  certain  classes  of  partial  information  disturbance  attenuation  problems  can  be 
solved  numerically  essentially  because  variation  methods  are  available.  Over  the  three 
year  period  of  the  grant,  robust  and  adaptive  guidance  and  control  laws  which  are 
mechanizable  with  near-future  computer  technology  are  developed  which  can  meet  system 
objectives  in  the  presence  of  large  uncertainties,  system  structural  changes,  and 
nonlinearities.  Of  particular  importance  emerging  from  our  focus  on  disturbance 
attenuation  is  a  new  structure  for  adaptive  control,  new  detection  filters  for  detection  and 
identifjting  structural  changes,  and  methodologies  for  including  system  nonlineanties. 
Fmally,  a  new  multiple-hypothesis  adaptive  estimator,  using  a  single  linear  filter  whose 
parameters  are  changing  according  to  the  on-line  computation  of  the  probability  of  each 
hypothesis  conditioned  of  the  residual  history,  is  shown  to  have  better  or  equivalent 
performance  to  the  current  bank  of  filters  concept  with  dramatic  decrease  in  computation 
and  ease  of  implementation. 

2.  Status  of  the  Research  Effort 

The  general  missile  guidance  problem  can  be  formulated  as  the  minimization  of  some 
statistical  function  of  the  terminal  miss  subject  to  a  nonlinear  stochastic  system  composed 
of  missile  and  target  dynamics,  a  noisy  nonlinear  measurement  of  the  state  space,  and  in¬ 
flight  constraints  on  the  system  structure  and  aerodynamics.  In  the  stochastic  control 
context,  the  terminal  miss  distance  is  described  by  a  probability  density  function.  The 
object  is  to  shape  this  density  function  to  give  the  desired  performance.  For  example,  the 
expected  value  of  the  miss  distance  could  be  the  performance  index.  However,  it  may  be 
desirable  to  allow  the  mean  miss  to  increase  somewhat  if  the  dispersion  about  the  mean 
can  be  decreased.  The  addition  of  an  intelligent  target  significantly  increases  the 
complexity  of  the  problem.  Solution  to  this  stochastic  control  problem  by  dynamic 
programming  is  not  currently  feasible.  Only  for  the  simplest  class  of  stochastic  control 
problems  are  solutions  available.  This  class  includes  the  linear-quadratic-Gaussian  (LQG) 
and  linear-exponential-Gaussian  (LEG)  problems.  In  Section  2.1,  for  the  discrete-time 
LEG  problem,  the  solution  of  this  problem  for  both  the  centralized  and  decentralized 
information  structures  are  shown  to  reduce  to  that  of  a  class  of  differential  games 
equivalent  to  that  used  to  solve  the  disturbance  attenuation  problem. 

Since  the  solution  to  the  stochastic  control  problem  is  currently  not  feasible,  our 
approach  to  robust  and  adaptive  guidance  and  control  is  to  consider  a  disturbance 
attenuation  formulation  which  produces  a  coherent  firamework  for  developing  and 
combining  guidance  laws  and  nonlinear  estimators  without  excessive  structural 
assumptions.  This  deterministic  approach  to  uncertainty  based  on  differential  game  theory 
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allows  certain  classes  of  partial  information  problems  to  be  solved  by  variational  methods 
which  are  characterized  by  ordinary  differential  equations  rather  that  by  the  partial 
differential  equation  of  dynamic  programming,  thereby  substantially  reducing  the 
numerical  complexity.  As  significant  amount  of  progress  has  been  made  in  understanding 
and  exploiting  the  relationship  between  minimax  and  saddle  point  strategies  of  these 
differential  games  for  particular  classes  of  nonlinear  dynamic  systems.  For  example,  in 
Section  2.2,  a  synthesis  technique  for  the  design  of  robust  flight  control  and  missile 
autopilots  in  the  presence  of  system  parameter  and  initial  conditions  uncertainties  was 
developed  via  a  disturbance  attenuation  approach.  Whereas  the  objective  of  robust 
controller  design  is  to  make  the  system  insensitive  to  the  parameter  uncertainty,  our  efforts 
also  focused  on  a  new  class  of  adaptive  control  techniques  in  which  the  value  of  the 
uncertain  parameters  are  to  be  estimated.  In  Section  2.3,  it  is  shown  that  the  structure  of 
this  new  adaptive  control  scheme  is  the  result  of  formulating  a  disturbance  attenuation 
problem  for  a  particular  class  of  nonlinear  systems  whose  solution  is  obtained  without  any 
approximation.  A  global  solution  is  obtained  and  must  be  contrasted  with  much  of  the 
nonlinear  results  which  assumes  that  the  scheme  operates  locally  about  some 
equilibrium  condition.  The  class  of  nonlinearities  considered  is  that  of  a  linear  system 
where  the  coefficient  matrix  of  the  control  is  assumed  to  be  a  linear  function  of  an 
unknown  parameter.  To  bring  these  mathematical  abstractions  to  engineering  practice,  a 
significant  effort  was  made  to  apply  this  new  adaptive  control  scheme  to  the  development 
of  an  adaptive  flight  control  system  for  a  high  angle-of-attach  aircraft  such  as  the  F-18 
HARV  (High  Angle-of-attach  Research  Vehicle). 

Since  target  motion  detection  is  important  for  miss  distance  reduction  in  missile 
guidance,  two  techniques  were  developed  and  discussed  in  Section  2.4.  The  first  is  based 
on  a  disturbance  attenuation  problem  in  order  to  achieve  a  degree  of  robustness.  In  the 
limit,  when  the  transmission  bound  is  brought  to  zero,  the  classical  detection  filter  is 
recovered.  In  the  second  approach  a  multiple  hypothesis  adaptive  estimator  for  tracking 
in  the  presence  of  target  motion  is  developed.  By  replacing  the  old  concept  of  using  a 
bank  of  filters  each  tuned  to  a  particular  hypothesis,  a  single  filter  is  used  where  the 
parameters  are  changed  according  to  the  on-line  computation  of  the  probability  of  each 
hypothesis  conditioned  on  the  residual  history. 

In  Section  2.5,  robust  missile  guidance  laws  with  bearing-only  measurements  based  on 
the  dissapative  inequality  are  shown  to  satisfy  a  disturbance  attenuation  bound  if  certain 
modifications  to  the  guidance  law  are  made  and  certain  conditions  are  satisfied. 
Furthermore,  in  a  different  approach,  over  a  finite-time  interval,  for  a  class  of  linear  time- 
varying  dynamical  systems,  an  approximate  nonlinear  optimal  estimator  and  control 
scheme  are  derived  based  on  an  expansion  technique  for  solving  the  disturbance 
attenuation  problem.  Finally,  in  Section  2.6  using  an  expansion  technique,  the  midcourse 
guidance  problem  for  tactical  and  ballistic  missile  intercepts  is  solved  where  velocity  on 
target  is  maximized  subject  to  miss  distance  and  orientation  constraints. 

In  the  following  subsections  the  work  on  this  grant  are  briefly  described.  The  papers 
[2,  6,  7,  9,  10,  13,  14,  15,  16,  17,  18]  represent  the  work  performed  on  this  grant.  Since 
[10]  is  in  the  early  stages  of  the  review  process,  and  because  of  its  importance,  it  is  given 
in  the  Appendix. 
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2.1  Some  Relationships  Between  Stochastic  Control  and  Disturbance  Attenuation 

For  the  discrete  time  LEG  problem,  where  the  dynamics  and  measurements  are  linear  with 
additive  Gaussian  uncertainty  and  the  performance  index  is  the  expected  value  of  the 
exponential  of  a  quadratic  function,  the  solution  of  this  problem  is  equivalent  to  that  of  a 
differential  game  [1,2].  Note  that  in  [2]  the  extension  of  the  LEG  problem  to  a  class  of 
decentralized  information  stractures  is  presented.  Similarly,  for  linear  systems  the 
disturbance  attenuation  problem  reduces  to  a  differential  game  [3].  The  {q)pealing  aspect 
of  disturbance  attenuation  problems  is  that  they  are  a  deterministic  methodology  to  handle 
stochastic  problems.  Therefore,  the  disturbance  attenuation  approach  is  being  extended  to 
classes  of  nonlinear  problems.  It  should  be  noted  that  the  deterministic  disturbance 
attenuation  formulation  can  be  obtained  from  a  stochastic  control  formulation  through 
large  deviation  theory  [4].  Although  many  important  properties  such  as  dual  control 
appear  lost,  the  disturbance  attenuation  approach  still  allows  a  unified  theory  for  the 
coherent  development  of  guidance  laws  assuming  partial  information.  The  functions  of 
guidance  and  estimation  may  be  integrated  together  without  gross  assumptions  such  as  the 
certainty  equivalence  principle,  although  aspects  of  this  concept  occur  naturally. 

2.2  Robust  Game  Theoretic  Synthesis  in  the  Presence  of  Uncertain  Initial  States 

Many  controller  design  problems  are  multicriteria  problems,  i.e.,  there  is  more  than  one 
performance  criterion  to  be  considered.  For  example,  in  the  bench  mark  problem  [5],  the 
controller  is  required  to  satisfy  certain  design  specifications  on  the  settling  time,  peak 
value  of  the  control  effort,  and  the  sensitivity  of  the  output  to  the  measurement  noise. 
Despite  a  significant  research  effort  into  the  reduction  of  the  sensitivity  of  the  output  to 
the  process  and  measurement  disturbances,  for  example,  in  or  linear-quadratic- 
Gaussian  (LQG)  control  theories,  improvement  of  transient  performance,  measured  as 
settling  time  and  peak  amplitude  of  output  due  to  nonzero  initial  states,  draws  little 
attention  in  control  theory  for  linear  time-invariant  (LTI)  systems.  The  usual  LTI  control 
problem  has  been  concerned  with  optimization  problems  with  a  performance  criterion  on 
the  sensitivity  of  output  to  the  input  process  and  measurement  disturbances,  with  the 
result  that  various  weighting  strategies  have  been  introduced  to  improve  transient 
performances  of  the  closed-loop  system.  However,  only  experienced  designers  have  a 
good  feel  for  the  relation  of  the  weighting  matrices  to  the  transient  behavior  of  the  closed- 
loop  system.- 

Strict  performance  requirements  on  the  controller  design  often  demand  an  exact  model 
of  the  system  to  be  controlled.  However,  uncertainties  are  usually  involved  in  the 
mathematical  description  of  physical  systems.  As  a  result,  a  closed-loop  system  in  the 
presence  of  system  uncertainty  often  has  poor  performance  or  even  becomes  unstable 
when  the  controllers  are  optimized  with  respect  to  a  nominal  model  of  the  system.  Hence, 
robustness  of  performance  with  respect  to  system  uncertainty  must  also  be  considered  in 
controller  design.  Many  physical  systems  have  real-valued  structured  uncertainty.  For 
these  systems,  robust  controller  design  based  upon  an  unstructured  uncertainty  model,  as 
in  the  control,  or  on  a  complex-valued  structured  uncertainty  model,  as  in  p.  -synthesis, 
may  produce  conservative  results.  These  observations  lead  to  the  introduction  of  a 
constrained  disturbance  attenuation  problem  (CDAP).  The  idea  in  this  problem  can  be 
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roughly  stated  as  follows:-  A  controller  is  to  be  found  that  minimized  the  worst-cast 
transmission  of  initial  state  to  the  output  out  of  a  set  of  controllers  that  provide  internal 
stability  and  achieve  a  prescribed  level  of  disturbance  attenuation  for  all  real  parameter 
uncertainties  in  a  given  set.  The  transmission  of  a  nonzero  initial  state  to  the  output  is 
closely  related  to  the  transient  response  of  the  system.  Hence,  transient  behavior  of  the 
control  system  as  well  as  attenuation  of  disturbances  are  directly  considered  in  CDAP,  in 
contrast  to  the  usual  disturbance  attenuation  problem  or  control  problem  in  which  CHily 
attenuation  of  disturbances  is  considered.  A  game  theoretic  approach  is  employed  for  the 
design  of  the  optimal  controller  to  solve  the  CDAP  [6].  Initial  state,  process  and 
measurement  disturbances,  and  unknown  system  parameters  are  considered  as  the 
maximizing  players,  whereas  the  control  is  treated  as  the  minimizing  player  in  a  minimax 
game  problem.  The  game  cost  criterion  consists  of  a  positive  term  composed  of  a 
quadratic  norm  on  the  system  output  and  negative  terms  composed  of  a  quadratic  norm  on 
the  disturbances.  Instead  of  being  penalized  in  the  game  cost  criterion,  initial  state  and 
uncertain  system  parameters  are  restricted  to  lie  on  or  within  the  prescribed  sets  (here  they 
are  multidimensional  ellipsoids).  In  contrast  to  the  results  in  [3],  the  cost  criterion  is 
nonseparable  and  this  formulation  produces  multiple  worst  initial  states  for  a  given  linear 
controller.  These  differences  make  it  difficult  to  use  the  simple  method  of  [3]  based  upon 
two  algebraic  Riccati  equations  in  solving  the  game  problem.  As  a  result,  a  linear  fixed- 
order  dynamic  compensator  based  upon  partial  information  is  assumed.  The  controller  is 
viewed  as  a  function  of  a  control  parameter  vector. 

2.3  An  Adaptive  Controller  Based  on  Disturbance  Attenuation 

To  understand  the  underlying  issue  of  generalizing  the  disturbance  attenuation  problem  to 
nonlinear  dynamical  systems,  we  have  examined  the  problem  of  linear  systems  with 
uncertain  parameters  in  the  control  coefficient  matrix  [7].  First,  the  minimax  problem  is 
reduced  to  a  one  sided  control  problem  whose  numerical  solution  for  real-time  control  is 
challenging.  To  simplify  the  numerical  difficulties  of  finding  a  nonlinear  feedback 
controller,  a  dynamic  programming  approach  is  used.  In  the  dynamic  programming 
approach,  the  control  problem  decomposes  into  a  sum  of  an  optimal  return  function 
representing  a  full  information  game  problem  and  an  optimal  accumulation  function 
representing  an  associated  estimation  problem.  The  worst  case  state,  found  by  maximizing 
the  sum  with  respect  to  the  current  state,  is  then  used  in  the  full  information  controller. 
Previous  results  required  that  the  worst  case  state  to  be  a  singleton  [8].  We  show  [7]  that 
the  existence  of  a  solution  to  the  minimax  problem  guarantees  a  unique  saddle  control  and 
the  validity  of  the  dynamic  programming  approach,  even  when  the  worst  case  state  is  not  a 
singleton.  Note  that  interesting  adaptive  control  problems  now  can  be  explicitly  solved 
producing  implemetable  controllers,  whereas  solutions  to  their  stochastic  counterpart  are 
still  not  available. 

An  alternate  and  more  direct  approach  is  given  in  [9]  where  the  minimax  problem  is 
reduced  to  an  equivalent  full  information  control  problem.  The  full  information  state 
space  dynamics  are  composed  of  an  estimator  equation  and  its  associated  Riccati  equation. 
The  equivalence  between  the  minimax  adaptive  controller  and  a  saddle  point  certainty 
equivalence  adaptive  controller  is  shown  via  Hamilton-Jacobi-Bellman  theory  where  the 
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optimal  return  function  is  differentiable.  Points  of  discontinuity  in  the  partial  derivatives  of 
the  optimal  return  function  are  shown  to  form  a  manifold  of  Darboux  points,  from  which 
multiple  global  optimal  trajectories  emanate.  Therefore,  the  Dynamic  Programming 
approach  can  be  extended  to  be  valid  over  the  entire  phase  space,  and  the  uniqueness  of 
the  value  of  the  optimal  return  function  is  guaranteed.  We  finally  show  that  with 
additional  assumptions  the  finite-time  problem  can  be  extended  to  infinite  time. 

To  begin  to  understand  the  implementation  issues  of  this  new  adaptive  controller,  the 
longitudinal  mode  of  the  F-18  was  controlled.  Since  a  global  maximum  of  the  sum  of  the 
optimal  return  function  and  the  optimal  accumulation  function  is  required,  only  the  most 
important  parameter,  the  moment  coefficient  due  to  elevator  deflection,  is  used  as  the 
adversary.  The  coefficient  of  the  other  control  variable,  thrust  vectoring,  is  assumed 
known.  As  shown  [10],  for  initial  conditions,  remarkable  performance  is  obtained  over 
current  adaptive  controllers.  In  regimes  where  the  scalar  parameter  is  quite  unknown  the 
control  emphasizes  thrust  vectoring  and  reduces  the  elevator  deflection  to  be  almost  zero. 
As  moi«  information  is  obtained  the  controller  begins  to  use  more  elevator  deflection  and 
less  thrust  vectoring  which  is  eventually  faded  to  zero.  This  is  to  be  contrasted  with 
standard  adaptive  controllers  in  which  substantial  elevator  deflection  is  used  early  even 
though  the  parameter  is  the  wrong  sign.  Therefore,  the  initial  response  is  in  the  wrong 
direction.  The  inherent  conservative,  but  intelligent,  performance  of  the  new  controller  is 
associated  with  the  two  worst  cast  states  in  which  usually  only  one  is  a  global  maximum. 
In  the  beginning  the  global  worst  case  state  dictates  a  conservative  policy  where  the 
elevator  deflection  is  made  small  and  the  thrust  vectoring  dominates  the  response.  At 
some  time,  say  tc,  both  worst  case  states  produce  identical  cost.  Here,  there  is  a  switch 
from  the  conservative  policy  to  one  similar  to  that  of  the  standard  adaptive  controller. 

2.4  Detection  and  Identification  of  Structural  Changes 

Accurate  target  motion  defection  and  identification  are  important  in  reducing  terminal 
miss  of  homing  missiles.  In  [11]  a  detection  filter  approach  to  detecting  and  identifying 
target  acceleration  was  used  when  the  relative  position  can  be  measured  using  angle  and 
range  measurements.  In  [12]  a  detection  filter  approach  to  fault  detection  is  developed 
from  an  eigensystem  assignment  approach.  The  detection  filter  is  an  observer  whose  gains 
are  constructed  such  that  the  presence  of  a  fault  induces  in  the  measurement  residuals  an 
invariant  (fixed)  direction.  Therefore,  during  a  sudden  maneuver  of  the  target,  although 
the  detection  filter’s  estimate  of  the  state  of  the  target  is  inaccurate,  the  target’s 
acceleration  direction  can  be  determined  by  examining  the  detection  filter  residuals  of  the 
position  measurement.  These  fault  detectors  are  made  robust  to  variations  in  the  system 
parameters  as  well  as  exogenous  inputs  by  modification  of  the  detection  filter  structure  or 
by  reformulating  as  a  disturbance  attenuation  problem.  During  this  grant  period  an 
alternative  approach,  called  the  multiple-hypothesis  adaptive  filter,  was  also  developed 
which  allows  a  dramatic  reduction  in  computation  over  that  of  the  bank  of  filter  concept 

Detection  filters  form  a  class  of  linear  observers  that  produce  residuals  with  known 
and  fixed  directional  characteristics  in  response  to  a  set  of  design  fault  directions.  In 
practical  applications,  reliable  fault  isolation  takes  place  only  when  the  detection  space 
structure  is  insensitive  to  system  parameter  variations.  A  left-eigenvector  assignment 
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approach  is  developed  [13]  that  allows  for  the  application  of  existing  results  relating 
supremal  controllability  subspaces  and  ill-condition^  eigenvectors.  Modifications  to  the 
detection  filter  structure  that  yield  improved  eigenvector  conditioning  and  sensitivity  to 
system  parameter  variations  arc  applied  to  an  aircraft  accelerometer  fault  detection  filter. 

An  alternate  approach  to  detection  filter  design  is  given  in  [14]  where  the  fault 
detection  process  is  approximated  with  a  disturbance  attenuation  problem.  The  solution 
to  this  problem  leads  to  an  filter  which  bound  the  transmission  of  all  exogenous  agnals 
save  the  fault  to  be  detected.  In  the  limit,  when  the  transmission  bound  is  brought  to  zero, 
a  complete  transmission  block  is  achieve  by  embedding  the  nuisance  inputs  into  an 
unobservable,  invariant  subspace.  As  this  is  the  same  subspace  structure  seen  in  some 
types  of  detection  filters,  one  can  then  make  the  claim  that  the  game  theoretic  filter 
asymptotically  becomes  a  detection  filter.  One  can  also  make  use  of  this  subspace 
structure  to  reduce  the  order  of  the  limiting  game  theoretic  filter  by  factoring  this  invariant 
subspace.  The  resulting  lower  dimensional  signal  will  then  be  sensitive  only  to  the  failure 
to  be  detected.  This  approach  also  extends  detection  filter  design  to  time-varying 
problems. 

In  [15,  Appendix],  a  new  algorithm  for  adaptive  estimation  of  time-varying  parameters 
in  certain  classes  of  linear  stochastic  dynamic  systems  has  been  developed.  The  algorithm 
is  based  on  an  adaptive  Kalman  filter  whose  hypothesized  parameters  are  modified  at  each 
stage  by  generating  the  probability  of  each  hypothesis,  conditioned  on  the  residual  history 
and  a  given  probability  of  transition.  It  is  then  shown  that  when  a  particular  hypothesis  is 
true,  the  expected  value  of  the  corresponding  posterior  probability  conditioned  on  the 
residual  history  converges  to  unity.  Moreover,  the  expected  value  of  the  norm  of  the 
difference  between  the  constructed  error  covariance  and  the  true  posteriori  error 
covariance  is  shown  to  converge  to  a  lower  bound  close  to  zero.  By  invoking  an 
information  function,  the  filter  is  also  shown  to  be  robust  with  respect  to  modeling  errors. 
A  few  numerical  simulations  have  been  performed  to  evaluate  this  algorithm  against  the 
backdrop  of  the  multiple  model  adaptive  estimation  scheme. 

2.5  Approaches  to  Disturbance  Attenuation  for  Nonlinear  System 

Robust  nonlinear  filters  and  guidance  laws  are  investigated  using  the  deterministic 
methodology  of  disturbance  attenuation.  In  our  first  approach  we  show  that  using  the 
dissapative  inequality,  the  disturbance  attenuation  bound  is  satisfied  if  certain 
modifications  are  made  to  the  guidance  law  and  certain  conditions  are  satisfied.  In  a 
different  approach  where  the  nonlinearities  are  assumed  small  in  the  dynamic  system,  an 
approximate  nonlinear  optimal  estimator  and  control  scheme  are  derived  based  on  an 
expansion  technique  for  solving  the  disturbance  attenuation  problem. 

In  [16],  for  a  class  of  nonlinear  time-varying  system,  a  stable  nonlinear  game- theoretic 
filter  (GTF),  which  solves  the  disturbance  attenuation  problem,  is  obtained  with  a 
dissipative  approach.  For  this  special  class  of  nonlinear  functions,  called  modifiable 
nonlinearities,  the  estimation  error  propagates  almost  linearly.  A  sufficient  condition  to 
render  the  dissipative  inequality  satisfi^  for  the  GTF  with  respect  to  the  worst  strategy 
for  the  process  and  sensor  disturbances  is  derived.  The  filter  gain  is  obtained  from  a 
Riccati  differential  equation  (RDE)  which  results  from  bounding  the  dissipative  inequality. 
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In  bounding  the  dissipative  inequality,  the  design  parameters  are  scaling  coefficients 
multiplying  the  weighting  matrices  in  the  RDE.  Then,  a  sufficient  condition  for  the  GTF 
to  be  asymptotically  stable  is  derived  which  is  also  shown  as  the  sufficient  condition  of  the 
GTF  to  be  dissipative.  Furthermore,  the  disturbance  attenuation  property  is  implied  by  the 
dissipativity  of  the  GTF.  Next,  for  a  class  of  dynamical  system  with  modifiable  nonlinear 
measurement  functions  but  linear  dynamics,  an  implementable  stabilizing  time-varying, 
nonlinear  game-theoretic  controller  (GTC)  is  derived  by  bounding  the  dissipative 
inequality  of  the  feedback  control  system.  This  class  includes  the  terminal  phase  of  the 
missile  guidance  problem.  The  structure  of  this  GTC  is  assumed  fiom  a  natural 
modification  of  the  corresponding  linear  quadratic  game  problem  to  accommodate  the 
above  mentioned  modifiable  nonlinearity.  Sufficient  conditions  for  both  the  dissipativity 
and  the  internal  stability  of  the  overall  measurement  feedback  control  system  are  obtained 
provided  two  coupled  RDE  are  satisfied.  These  RDEs  are  somewhat  modified  fiom  those 
obtained  from  the  corresponding  linear  quadratic  problem. 

In  [17],  over  a  finite-time  interval,  for  a  class  of  linear  time-varying  dynamical  systems 
with  a  small  nonlinearity,  an  approximate  nonlinear  optimal  estimation  scheme  is  derived 
based  on  a  deterministic  game-theoretic  criterion.  Using  the  calculus  of  variation 
approach,  this  game-theoretic  criterion  is  first  maximized  by  the  process  disturbance  and 
initial  state  vectors.  The  resulting  optimality  condition  is  expanded  with  respect  to  a  small 
parameter  e  and  the  expression  of  the  worst  case  state  and  the  Lagrange  Multiplier 
vectors  are  determined.  Subsequently,  the  approximate  game-theoretic  estimator  is 
derived  by  minimizing  each  term  in  the  series  of  cost  criterion  over  the  corresponding 
element  of  state  estimate  vector  expansion.  The  estimator  Riccati  differential  equations 
(RDE)  necessary  for  the  first  and  higher  order  correction  terms  are  the  same  as  in  the 
zeroth-order  case.  The  first-order  and  higher-order  correction  terms  are  computed  on-line 
based  on  nonlinear  functions  evaluated  along  the  minimax  trajectory  of  the  zeroth-order 
state  estimate  which  has  to  be  updated,  through  a  backward  integration,  as  each  new 
measurement  becomes  available.  The  infinite-order  approximate  minimax  estimator  is 
shown  to  be  a  priori  disturbance  attenuating.  The  Nth-order  approximate  minimax 
estimator  achieves  disturbance  attenuation  with  an  incremental  increase  in  the  bound 
proportional  to 

2.6  Near-Optimal  Missile  Guidance  For  Tactical  and  Theater  Defense 

In  [18],  a  perturbation  procedure  is  applied  to  the  problem  of  finding  an  optimal 
control  for  a  ballistic  missile  interceptor.  Certain  forces,  such  as  thrust  and  gravity,  are 
assumed  to  dominate  the  equations  of  motion.  The  optimal  control  problem  is  integrable 
if  the  remaining  forces  are  neglected;  the  approximate  effects  of  the  neglected  forces  can 
be  calculated  noniteratively  and  added  to  the  solution.  For  certain  trajectories,  however, 
the  aerodynamic  forces  are  not  negligible.  Including  the  aerodynamics  directly  in  the 
dominant  dynamics  destroys  the  analytical  solution  upon  which  the  procedure  depends. 
Instead,  approximations  of  the  aerodynamic  forces  are  included  through  narrow  pulse 
functions.  This  technique  produces  a  good  approximation  to  the  optimal  control  and  is 
computationally  more  efficient  than  previous  methods.  Extensions  to  previous  work  are 
also  made  to  account  for  the  interceptor’s  coast  phase  and  terminal  constraints.  The  near- 
optimal  guidance  law  is  used  to  produce  intercept  trajectories  against  a  number  of  target 
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trajectories.  The  approximate  trajectories  compare  well  with  numerically-generate  optimal 

trajectories. 
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Abstract 

A  new  algorithm  for  adaptive  estimation  of  time-varying  parameters  in  certain  classes 
of  linear  stochastic  dynamic  systems  has  been  developed.  The  algorithm  is  based  on  an 
adaptive  Kalman  filter  (  AKF  )  whose  hypothesized  parameters  are  modified  at  each  stage 
by  generating  the  probability  of  each  hypothesis,  conditioned  on  the  residual  history  and  a 
given  probability  of  transition.  It  is  then  shown  that  when  a  particular  hypothesis  is  true, 
the  expected  value  of  the  corresponding  posterior  probability  conditioned  on  the  residual 
history  coverges  to  unity.  Moreover,  the  expected  value  of  the  norm  of  the  difierence 
between  the  constructed  error  co\ariance  and  the  true  posteriori  error  covariance  is  shown 
to  converge  to  a  lower  bound  close  to  zero.  By  invoking  an  information  function,  the  filter 
is  also  shown  to  be  robust  with  respect  to  modeling  errors.  A  few  numerical  simulations 
have  been  performed  to  evaluate  this  algorithm  against  the  backdrop  of  the  multiple  model 
adaptive  estimation  (  MMAE  )  scheme. 


1  Introduction 

We  consider  a  class  of  adaptive  estimation  problems  wherein  the  unknown  system  model  may  correspond 
to  one  of  the  specified  models  and  the  model  uucei*tainty  is  summarized  as  a  time-varying  parametric 
uncertainty.  In  pairticular,  we  concern  ourselves  with  estimation  in  linear  stochastic  systems  with  time- 
varying  parameters.  Earlier  attempts  to  tackle  this  problem  resulted  in  the  development  of  the  Multiple 
Model  Adaptive  Estimation  (  MMAE  )  algorithm,  fir.st  proposed  by  [1]  and  later  generalized  by  [2]  to  form 
the  framework  of  partitioned  algorithm^^  .  This  algorithm  addresses  the  most  basic  adaptive  estimation 
problem,  viz,  estimation  in  a  linear  stochastic  system  with  time-invariant  parametric  uncertainty.  It 
is  a  joint  estimation  and  system  identification  algorithm  consisting  of  a  bank  of  Kalman  filters,  each 
“matched”  to  one  hypothesis  and  aii  identification  subsystem,  which  maybe  construed  as  the  dynamics  of 
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a  sub-optimal  multiple  hypothesis  Wald’s  Sequential  Probability  Ratio  Test  (  WSPRT  ).  We  denote  the 
underlying  dynamics  of  the  WSPRT  by  ,  which  is  defined  as  the  posterior  probability  of  hypothesis  Tli 
conditioned  on  the  residual  history  upto  tk-  The  use  of  is  motivated  by  the  implicit  assumption  that 
we  are  dealing  with  a  time-invariant  parametric  uncertainty.  However,  as  stated  in  [3],  there  is  no  rigorous 
proof  that  the  posterior  probability  associated  with  the  true  model  will  converge  to  unity.  Moreover,  apart 
from  being  computationally  intensive,  this  algorithm  suffers  from  beta  dominance  [4],  which  arises  out  of 
incorrect  system  modeling  and  leads  to  irregular  residuals. 

Recently,  there  have  been  efforts  to  improve  the  performance  of  the  MMAE  algorithm  [5].  Recall 
that  the  recursive  relation  for  the  generation  of  F^  does  not  allow  for  transitions  from  one  hypothesis  to 
another  :  It  can  be  shown  that  if  the  conditional  probability  of  a  particular  hypothesis  becomes  unity/zero, 
it  stays  at  unity/zero  irrespective  of  what  the  correct  hypothesis  is.  To  avoid  this,  the  recursive  relation  was 
modified  by  upper  and  lower  bounding  the  conditional  probabilities  of  all  hypotheses.  Secondly,  in  an  affort 
to  remove  beta  dominance,  the  conditional  density  functions  were  altered  by  removing  the  covariance  term 
from  the  denominator.  The  probabilities  still  sum  to  one,  though  the  “density”  functions  are  no  longer 
scaled.  However,  there  appears  to  be  no  rigorous  theoretical  justification  for  both  these  procedures. 

We  develop  a  new  eJgorithm  based  on  a  single  adaptive  Kalman  filter  wherein  the  time- varying  param¬ 
eters  are  updated  by  feeding  back  the  posterior  probability  of  each  hypothesis  conditioned  on  the  residual 
process.  It  is  then  shown  that  the  expected  value  of  the  true  posterior  probability  coverges  to  unity  and, 
under  certain  assumptions,  the  expected  value  of  the  norm  of  the  difference  between  the  constructed  error 
covariance  and  the  true  posteriori  error  covcuriance  converges  to  a  lower  bound.  It  is  also  shown  that  in  the 
presence  of  modeling  errors,  the  filter  converges  to  the  hypothesis  which  maximizes  a  certain  information 
function.  We  also  make  a  comment  about  the  extension  of  the  MMAE  algorithm  by  using  the  dynamics 
F^^  of  a  multiple  hypothesis  Shiryayev  sequential  probability  ratio  test  (  MHSSPRT  ),  which  explicitly 
allows  for  transitions  to  occur.  This  paper  is  orgainzed  as  follows.  In  section  2,  we  form  the  framework 
of  the  time-varying  parameter  estimation  problem.  In  section  3,  we  highlight  the  salient  features  of  the 
MMAE  scheme.  In  section  4,  we  develop  the  AKF  algorithm  and  in  .section  5,  we  derive  the  properties 
of  this  scheme.  In  section  6,  we  compare  the  two  algorithms  in  varioius  numerical  simulations.  Finally,  in 
section  7,  we  conclude  by  summarizing  the  AKF  algorithm  and  its  theoretical  properties. 

2  Problem  Statement 

Consider  a  linear  time- varying  stochastic  sy.stem  : 

®ik+i  =  Ak-Xk  +  h  + 
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Vk 


Ck^  Xk  dk  Vk 


(2) 


wherein  Xk  €  3?”  is  the  state,  yk  E  3?^  is  the  measurement,  bk  G  3?”  and  dk  €  Re^  are  bias  vectors,  while  Ak 
and  Ck  are  given  matrices  of  appropriate  dimensions.  Under  each  hypothesis  the  process  noise 
and  meaisurement  noise  {vjt}  sequences  are  white,  with  the  following  statistics  : 


Vk  ^  M  {Q,  Vi) 
Wk  -  JViO.Wi) 


Ak  =  Aki 
Ck  =  Cki 


bk  =  bi 
dk  =  di 


(3) 

(4) 


Note  that  instead  of  parameterizing  the  noise  statistics  and  other  matrices,  we  have  hypothesized  them. 
Clearly  they  are  equivalent. 

Now,  as  a  particular  application,  we  can  reduce  the  problem  of  detection  and  isolation  of  the  occurence 
of  a  change  in  a  correlated  measurement  sequence,  by  assuming  an  ARMA  model  for  the  measiureraent 
process.  Assuming  the  AR-coefficients  to  be  time-varying,  we  can  formulate  a  state-space  equivalent  of  the 
ARMA  process  as  : 


Xk-^i  =  Ak  •  Xk  +  bk  +  ^Vk 
Vk  =  Ck'  Xk  ^  dk  +  Vk 


(5) 

(6) 


wherein  yk  €  3f?^  is  the  measurement,  Ck  =  [  yk-i\  •  •  •  IVk-n  ]  is  the  measurement  matrix,  Xk  €  3?^  are  the 
AR-coefficients,  Ak  is  a  given  matrix  and  bk  and  dk  are  appropriate  bias  vectors.  Again,  from  (3)- (4),  the 
process  and  sensor  noise  sequences  are  white  with  different  statistics  under  different  hypotheses. 

In  any  case,  the  problem  maybe  stated  as  follows  :  Identify  the  current  system  model  in  minimum  time, 
by  detecting  and  isolating  a  change  in  the  measurement  process.  As  stated  earlier,  all  existing  algorithms 
have  an  identification  subsystem  imbedded  in  them  :  this  subsystem  maybe  construed  as  the  recursive 
relation  for  FJf-  ,  which  assumes  that  no  change  occurs  in  the  measurement  process  when  the  test  is  in 
progress.  However,  in  oiu*  AKF  algorithm,  we  explicitly  model  the  probability  of  a  transition  from  one 
hypothesis  to  another,  thereby  allowing  for  time-varying  hypotheses  and  using  the  recursive  relation  for 
^ki  I®]*  extend  the  MMAE  algorithm  to  tiine-vaiying  hypotheses  by  using  this  instead  of  the 

bounded  Finally,  we  develop  sufficient  conditions  for  the  convergence  of  this  adaptive  filter  structure. 

3  MMAE  Algorithm 

The  Multiple  Model  Adaptive  Estimation  algorithm  and  its  variations  are  widely  used  for  parameter 
estimation  in  linear  stochastic  systems  [3][4].  Let  there  be  L  +  1  linear,  discrete-time  stochastic  dynamic 
systems,  etich  generating  measurements  corrupted  by  white  noise  :  therefore,  the  available  measmement 


3 


sequence  maybe  aissumed  to  correspond  to  one  of  the  m  different  hypotheses.  The  sensor  and  process 
noise  statistics  vary  with  each  hypothesis.  One  can  then  construct  a  bank  of  L  +  1  discrete-time  Kalman 
filters,  each  “matched”  to  one  hypothesis,  generating  a  white  residual  process,  provided  the  corresponding 
hypothesis  is  the  true  one.  The  residual  process  becomes  the  input  to  the  recursive  relation  for  which 
generates  the  posterior  probability  of  eaich  hypothesis,  conditioned  on  the  measurement  sequence.  This 
leads  to  a  neat  parallel  structure,  shown  in  figure  1. 


Figure  1:  Multiple  Model  Adaptive  Estimation  -  Lainiotis  Filters 

The  update  equations  for  a  generic  Kalman  Filter  for  hypothesis  Hi  become  : 

Ski  =  Cki  ■  Mki  ■  Cki^  +  k, 

Kki  =  Mki-Cki'^  Ski-^ 

d^ki  —  ^ki  "h  ^ki  ‘  [  I/fe  ""  ^ki  '  ^ki  ~  dki  ] 


Pfci  =  [I-Kki-Cki]-Mki  00) 

f/fci  =  Vk  —  Cki  ■  Xki  -  dki  01) 

Tiki  =  [ni---rki]  02) 

Tlk  =  [Tlki--‘TlkM]  03) 


wherein  Mki  >s  the  apriori  error  covariance  matrix,  Pki  is  the  posteriori  covariance  matrix,  Xki  is  the  apriori 
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state  estimate  and  x^i  is  the  posteriori  state  estimate  at  time  The  propagation  equations  become  : 

“  ^ki  '  ^ki  d"  ^ki 

Mk+i,i  =  Aki-Pki-Aki'^  +  Wi  (15) 

In  the  cited  literature  and  in  the  figure  shown,  is  generated.  However,  allowing  transitions  firom  one 
hypothesis  to  another,  we  generate 

Fij  ^  PinjiTik) 

The  overall  posteriori  state  estimate  and  error  covariance  become  : 

j 

Pk  =  Pkj  +  (  4  -  -  Xkj  V  }  ■  (17) 

j 

We  note  the  following  : 

•  The  noise  characteristics  of  each  filter  are  time-invariant. 

•  As  the  number  of  hypotheses  increases,  the  algorithm  becomes  computationally  intensive,  as  one 
needs  to  compute  all  the  time-varying  filter  gains.  To  alleviate  this  problem,  sometimes  the  steady- 
state  gains  of  each  Kalman  filter  are  used,  instead  of  the  time-varying  gains  [3].  But  this  can  lead  to 
convergence  to  the  wrong  hypothesis. 

•  There  is  no  rigorous  proof  that  in  the  posterior  probability  associated  with  the  correct  hypothesis 
will  converge  to  unity. 

•  The  recursive  relation  for  or  assumes  that  the  residual  secpience  is  conditionally  independent, 
but  when  Hi  is  true,  TZ^j  is  not  conditionally  independent  V  j  i.  Hence  the  generated  F^^  or 

is  always  wrong  no  matter  what  the  correct  liypothesis  is. 

•  Under  certain  circumstances  [3],  the  algorithm  leads  to  the  convergence  to  the  wrong  hypothesis. 
This  phenomenon  has  been  termed  as  beta  dominance  in  [4]. 

3.1  Beta  Dominance 

Let  “Hi  be  true  :  Then,  one  would  expect  the  residual  process  rti  to  be  small  while  the  residuals  of  the 
“mismatched”  Kalman  filters  to  be  large.  If  for  some  unknown  rea.son  (  choice  of  wrong  noise  statistics, 
for  example  )  this  doesn’t  happen,  it  can  be  shown  that  the  posterior  probability  of  Hi  might  actually 
decrease,  depending  upon  Skj  V  j.  Refer  aijpendix  A  for  the  proof. 
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Figure  2:  Adaptive  Kalman  Filter  Algorithm 

4  Adaptive  Kalman  Filter  Algorithm 

We  formulate  an  algorithm  based  on  a  structure  which  uses  a  single  adaptive  Kalman  filter  in  conjunction 
with  the  recursive  relation  for  Consider  figure  2  :  An  approximate  posterior  probability  F*,  of  each 
hypothesis  conditioned  on  the  residual  history  is  generated  and  fed  back  to  the  filter.  All  the  bias  vectors 
and  system  matrices,  including  the  process  and  sensor  noise  statistics,  are  updated  in  the  following  way  : 


Vk 

~  ATiO^Vk) 

Wk  ~  Af  {0,  Wk) 

Itk 

=  [  To  ri  .  .  .  Tfc  ] 

(18) 

Fki 

=  approximate 

Fk  =  [F,o  Fki...FkLf 

(19) 

Ak 

=  E,  rti  ■ 

w 

II 

Wk 

II 

(20) 

Ck 

=  E,  •  Cj 

dk  =  Fk-ij  •  dj 

Vk 

i-I 

1 

w 

II 

(21) 

We  derive  sufficient  conditions  for  the  convergence  of  F*:j  to  Fl!j  in  the  next  section.  As  mentioned  earlier, 
the  structiure  of  the  MMAE  algorithm  never  permits  the  exact  calculation  of  or  F^^ .  Note  that  ,  bf; 
and  Wk  are  updated  using  Fkj,  as  it  is  already  available.  That  is  not  the  case  for  Ck  ,  d*  and  14 .  However, 
this  does  not  produce  any  difference  in  the  theoretical  results  presented  later  on  in  Section  5.  The  filter 
equations  remain  the  same  except  that  we  remove  the  subscript  i  from  equations  (7)-(15).  Therefore  : 


5* 

=  Ck-Mk-Ck'^  +  Vk 

(22) 

Kk 

=  Mk-Ck'^-Sk-^ 

(23) 

Xk 

=  xk  + Kk-[yk- Ck- xk~  dk] 

(24) 

Pk 

=  [J-Kk-Ck]-Mk 

(25) 

Tk 

=  Vk-  Ck  -Xk—dk 

(26) 
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^k+l 


^k-  ik  +  bk 


(27) 

(28) 


Mt+l  =  -^k  ■  Pk  ■  ■AJ  + 


The  true  error  covariance,  Mk  ,  of  this  sub-optiinal  state  estimate  is  not  computed  in  this  algorithm  as  it 
requires  the  knowledge  of  the  correct  hypothesis  :  instead,  as  shown  later  on,  it  is  approximated  by  Mk- 
Of  course,  if  Mi  is  true,  we  can  compute  Mk  in  the  following  way  : 


ek  =  —  Xk 

ruk  =  E  {xk  /  Mk} 

Xk  =  E{xk-xlink} 
Pk  =  E{h-il/nk} 

Clearly  : 


Cfc  =  Xk  —  Xk 

ml  =  E{ek/nk} 

Ek  =  E{xk-el  /  Uk) 
Ek  =  Eixk-el/TZk} 


ml  =  E{ek/nk} 

Ek  =  E{xk-el/Mk} 

(29) 


ik  =  {I-Kk  Ck)-ek  -  Kk-(Cki-Ck)  xk  -  Kk  idk,-dk)  -  Kk-vk  (30) 

ek+i  =  Ak  ■  h  +  [Aki  -  Ak)  ■  Xk  +  {bki  -  bk)  +  Wk  (31) 


The  difference  equations  become  : 


Pk 

=  {I- 

Kk  -  Ck)  -Mk-{I-Kk-  Ck)'^  +  Kk  -  {Cki  -  Ck)  - 

Xk  ■  {Ckr  -  Ck)'^  ■ 

Kl 

+  Kk  ■  {dki  —  dk)  ■  {dki  —  dk)^  ■  KJ  +  Kk  Vi  ■  KJ 

-  2 

■  {I  -  Kk-Ck)  -  El  -  [Cki  -  Ck)^  -  Kl  -  2{I-Kk-Ck)-ml-idki- 

-dk)'^- 

Kl 

+  2 

■  Kk  -  {Cki  -  Ck)  ■  mk  -  (dki  -  dk)'^  -  Kj 

(32) 

=  Ak  ■ 

Pk-  AJ  +  {Aki  -  Ak)  -  Xk  -  {Aki  —  Ak)^  +  (t/;:;  - 

bk)  ■  {bki  —  bk)^  + 

W, 

+  2 

■  Ak  -  EJ  -  (i4ij  —  Ak)^  +  2  •  Ak  -  inl  •  {bki  —  bk)"^ 

+  2 

■  -Ak)-Tnk-  {bki  -  bk)'^ 

(33) 

Xk+i 

-  Xk  -  Ali  +  bki  -bli  +  Wi  +  2-  Aki  -  rrik  -  bj. 

(34) 

Ek 

=  Ek- 

{I-Kk-  Ck)'^  -  Xk  -  {Cki  -  Ckf  -  Kl  -  mk  -  Kk  -  {dki  -  dk) 

(35) 

Ek+i 

= 

-Ek-Al  +  Aki  -  Xk  -  (>liti  -  Ak)'^  +  Ak  -  ml  ■ 

+  Aki  -  mk  -  {bki  -bk)^  +  (^it.  -  Ak)-  ruk  -  bji  +  bki  - 

(iiti  -  bk)'^  +  Wi 

(36) 

^k+l 

=  Aki 

■mk  +  bki 

(37) 

ml 

=  (7- 

Kk  ■  Ck)  -ml  -  Kk  -  {Cki  -  Ck)  ■  THk  -  Kk  ■  {dki 

-dk) 

(38) 

Pnl+i 

=  Ak  - 

ml  +  {Aki  -  Ak)-  mk  +  {bki  -  bk) 

(39) 
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The  initial  conditions  become  : 


Xo  =  Po  +  ^Q-^o  JSb  =  Po 

mo  =  xq  ^0  =  0  ♦ 

These  equations  are  computationally  intensive,  but  can  be  computed  off-line  to  assess  the  performance 
of  the  AKF  algorithm  for  each  specific  application.  In  certain  cases,  the  off-line  computation  becomes 
necessary  to  analyze  the  steady  state  behavior  of  the  AKF  algorithm,  in  particular,  its  convergence  to  the 


correct  hypothesis.  Given  ‘Hi,  the  true  distribution  of  rt  is  : 

IkiiTk)  =  h{rkini,nk-x)  (40) 

E  \_Tk  /  Hi , —  {Pki  —  Ck)'Xk  +  {dki  —  dk)  =  hki  ('ll) 

E  [fk  •  fT  !  ^  P'k-i  ]  =  Cki-  Mk-  Cki^  +  Vi  =  Ski  ('12) 

Therefore  : 

( rfc  /  "Hi ,  Hk-\  )  ^  [hi  ^  Ski)  "  ■  (43) 


However,  since  the  correct  hypothesis  is  unknown,  the  AKF  algorithm  is  designed  to  approximate  the 
true  error  covariance  Mk  with  the  error  “covariance”  Mk  ,  computed  on-line  from  (25)  and  (28).  In  section 
5,  this  assumption  is  justified  by  showing  that  under  certain  conditions,  E  {Mk  f  Hi)  -4  Mki ,  wherein 
Mki  is  the  apriori  error  covariance  corresponding  to  the  i**'  filter  in  the  MMAE  algorithm.  It  is  also  shown 
that  E  {Mk  !  Hi)  Ml  ,  wherein  Ml  is  the  exact  apriori  error  covariance  derived  from  (17)  and 
E  {  Mk  /  Hi)  Mk  ■  At  each  tk,  we  assume  : 

E[rk/Hi,nk-i]  =  0 

E[rkTl  /Hi,nk-i]  =  Cki-Mk-Cki^  +  Vi  =  \ki  (44) 

Under  the  gaiissian  a.ssumption,  we  explicitly  construct  the  density  function  fki  (•)  as  : 
fkiirk)  =  Approx.  fki{rk) 

irklHi,nk-i)  ~  MiO,Aki)  (45) 

We  make  a  crucial  observation  that  {r/t}  is  no  longer  an  independent  residual  process.  Now,  P^i  requires 
the  knowledge  of  the  density  functions  fki  (•)  :  since  we  ai)proximate  these  functions  with  fki  (•) ,  P/ti 
been  approximated  by  Fki- 

Note  that,  by  removing  the  parallel  structure  of  the  MMAE  api>roach  while  retaining  the  time-varying 
filter  gain,  the  AKF  algorithm  is  computtiationally  le.ss  intensive,  especially  if  the  number  of  hypotheses  is 
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large.  Further,  following  the  argument  used  in  appendix  A,  it  can  be  shown  that  beta  dominance  cannot 
exist  in  this  structure  as  there  is  only  one  residual  process  here.  In  the  next  section,  we  address  issues 
concerning  the  convergence  and  robustness  of  this  algorithm. 

5  Performance  of  AKF  Algorithm 

We  now  consider  the  convergence  properties  of  this  algorithm.  We  propose  to  prove  the  following  : 

•  If  hypothesis  Tii  is  true,  then  the  generated  posterior  probability  of  Hj  (  called  )  decreases 
monotonically  \/  j  ^  i. 

•  If  hypothesis  Tii  is  true,  then  the  posterior  error  covariance  P/t  of  the  AKF  algorithm  converges  to 
Pki  of  the  ‘^correct”  filter  from  the  MMAE  algorithm. 

•  If  hypothesis  T-Li  is  true,  but  is  not  included  in  the  set  of  prol)able  hypotheses  ©  =  {  H j  },  then,  the 
generated  posterior  probability  converges  to  that  hypothesis  Hjn  €  0  which  maximizes  a  particular 
information  function. 

5.1  Underlying  Assumptions  of  MHSSPRT 

We  briefly  rederive  the  MHSSPRT  [6]  by  defining  the  following  notation  : 

TTi  =  P{n^) 

Pi  =  Apriori  probability  of  change  from  Ho  to  Hi  from  t^  to  tk+i,  Vk 
fkii’)  =  .Approximate  probability  density  function  of  r^  conditioned  on  Hi  and  7^k-i 

=  Approximate  probability  density  function  of  rk  mnditioned  on  Ho  and  'Pk-i 

L  -h  1  =  Number  of  hypotheses 

6i  =  Time  of  occurence  of  H\ 


At  Stage  ti  : 


P{0i<ti/ri)  = 

Pin)  = 

Piei<h)  = 


P{ri/e,<ti)-p{ei<ti) 

Piri) 

L 

E  ^  ^i)  •  ^  +  Pin/Oi  >ti)-  P{0i  >  h) 

i=\ 

P{0,  <  to)  +  P{ei  =  h/Oi  >  to) 


=  TTj  +p,  •  (1  -  TTi) 


(46) 


P{ri/6i<ti)  =  fu{r\)-dri 
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(47) 

(48) 

(49) 


Strictly,  (49)  denotes  the  probability  that  the  nieasureinent  lies  between  ri  and  ri  +dr\  given  the  occiurence 
of  Hi  at  or  before  ii- 


t=i  »=i 

From  (46),  we  get  : 

_  _ _ [TTi+Pi-  (1  -7ri)l-/ii(ri) _ 

~  Ef=i[’ri  +Pi  •  (1  -  TTi)]  •  hiin)  +  [1  -  Ef=i  +  Pi  •  (1  -  iTi)]  •  /lo(ri) 
At  Stage  2  : 

Pin^lOj  <  fz)  •  P{0i  <  t2) 

P{P-2) 

P{r2/0i<t2,ri)  =  /2i(r2)-dr2 

P(gi<<2/ri)-P(ri) 

P{0i  <  h) 

P(^2)  =  P(r2/ri)P(ri) 


P(ei<t2/P-2)  = 
2/0i  <  h  y  n)  = 
P{nie<t2)  = 


Since  from  (45),  we  know  the  density  function  !  Hi ,  Hk-\ )  from  (52)  : 


P{0i  <  <2/'^2)  = 


P(r2M  <  <2  ,  ri)  •  P(ri/g^  <  <2)  •  P{0i  <  h) 


P(P2) 

Now,  from  (53), (54), (55),  we  have  : 

/2i(r2)  •  P{0i  <  ^i/^l)  •  dr2 
P(r2/ri) 

P{9i  <  ^2/^1)  —  P(^i  ^  ^i/n)  +  P(^i  =  ^2/^i  >  ^Oi^'i) 


P{9i<  12/^2)  = 


=  Pl,i+Pz-(1-Pl,i) 


(50) 

(51) 

(52) 

(53) 

(54) 

(55) 

(56) 

(57) 

(58) 


P(r2/ri)  =  E ^  *2)  *  P{0i  >  <2/n) 

i=l 

L 

t=i 

+  [1  -  1]  +  Pi  ■  ~ 

i=l 

Clearly,  by  induction,  we  can  now  write  the  reclusive  relation  for  Fk+i,i  in  terms  of  Fk,i  ■ 

_ _ [Fk,i+Pi-il-Fk,i)]-fk+,A^k+i) _ _ 

Pfc+i,i  -  i  +  Pi  •  (1  -  Fk,i)]  •  fk+iAn+i)  +  [1  -  Ef=i  Pit,i  +  Pi  •  (1  -  Pfc,i)]  •  A+i,o(T-fc+i) 

Po.  =  -. 

Nowhere  have  we  made  any  assumptions  about  the  independence  of  the  residual  process.  From  (45),  we 
explicitly  construct  fki{  )  V  i  at  each  tk-  This  approximates  P^j  by  Fki  as  mentioned  in  the  earlier  section. 
In  the  next  section,  we  derive  sufficient  conditions  for  the  convergence  of  Pjti  and  the  associated  error 
covariance. 
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5.2  Convergence  of  the  Posterior  Probability 

We  seek  to  prove  that  when  Hi  is  true,  the  posterior  probabilities  of  all  hypotheses  Hj  j  ^  i  decrease. 


We  define  the  following  : 

T  t  {Sir/n)  :  -Hce}  (61) 

=  E[\n{fkj(rk)}/'Hi,nk-i]  (62) 

Pjmi  =  mpc  E  [  [  ^  ]  for  some  t  €  (0, 1) 

=  pjmUT^k-^i)  (63) 

Assumption  1  The  family  of  density  functions  T  tv  identifiable,  i.e  : 

f{r/Hi)  =  f{r/Hj)  iff  =  0^  Vr  (64) 

This  eissumption  is  invoked  to  prove  Assertion  1. 


Assertion  1  In  view  of  assumption  1,  beta  dominance  cannot  exist  in  the  AKF  algorithm. 

Proof  :  Refer  Appendix  Bl. 

In  an  effort  to  illustrate  the  classes  of  problems  for  which  Jji  is  an  information  function,  we  consider 
a  time-varying  ARMA  process  of  order  n,  wherein  the  measurement  noise  is  different  for  each  hypothesis. 
Therefore  under  each  hypothesis  Hi  ,  the  process  noise  and  measurement  noise  {vjt}  sequences  are 

white,  with  the  following  statistics  : 

VA:  ~  ( 0 ,  Vi  )  Ak  =  Ak  hk  =  bk 

Wk  -  Ar(0,W,  )  C,  =  Ci,  4  =  4  (65) 

wherein  Ck  =  [  iJk-il  ♦  *  •  1//^--'-.  ■  measurement  matrix.  We  now  ])rove  the  following  lemma. 

Lemma  1  LfCt  Hi  be  true.  Then,  for  the  ARMA  process  shown  in  (65)  : 

•  If  Vi  >Vk  and  Mk  >  Mk  ,  then  Mk-\-i  >  M^^i  V  A:. 

•  If  Vi  <Vk  andMk  <  Mk  ,  then  Mk+i  <  V  fc. 

Proof:  From  (32),  (33),  (25)  and  (28) 

Pk  =  {I-K,C,)MkiI-Kk-C,f  +  K,-Vi-K'[ 

Pk  =  (I-  Kk  •  C:)  Mkil-  K,  •  Ckf  +  Kk  ■  V,  ■  Kl 

•  A-  • 

A4+1  =  Ak  ■  Pk  ■  aJ  +  Wk 
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Therefore  : 


Pk  -  Pk  =  {I-Kk-  Ck)  •  {Mk  -  Mk)  •  (/  -  Kk  ■  Ckf  +  Kk  •  {V-  -  Vit)  •  Kl 
Mk+i  -  A^fc+i  =  ^k  •  {Pk  -  Pk)  ■  Ak 

Clearly  : 

Vi  >  Vk  ,  Mk  ^  ^k  ^ifc+i  >  Mk+i  V  k 

Vi  <  Vk,  Mk  <  Mk  =>  Mk+i  <  Mk+i  V  fc  4 

As  a  consequence,  from  (22), (42)  and  (44)  : 

Vi  >  Vj  =?►  ^kj  <  A/ti  <  Ski  Vk 

Vi  <  Vj  =>  Akj  >  Aki  >  Ski  V  k 

We  could  consider  another  class  of  problems  similar  to  (65)  wherein  the  hypotheses  differ  only 
process  noise  statistics,  i.e  ,  under  “Hi  ■ 

Wk  ~  A/"  (  0  ,  Wi ) 

W'e  now  prove  the  foilwing  lemma. 

Lemma  2  Let  'Hi  be  true.  Then,  for  the  process  .shown  in  (67)  : 

•  IfWi  >Wk  and  Mk  >  Mk  ,  then  Mk+i  >  Mk+i  V  A-. 

•  If  Wi  <  Wk  and  Mk  <  Mk  ,  then  Mk.^.\  <  Mk+\  V  k. 

Proof  :  The  proof  is  very  similar  to  the  one  of  Lemma  1.  Rom  (32),  (33),  (25)  and  (28) 

Pfc  =  (I  -  Kk  -  Ck)  -  Mk  ■  {I  —  Kk  ■  Ck)^  4-  Kk  -  Vk  ■  KJ 
Pt,  =  {I-Kk-  Ck)  -Mk-il-Kk-  Ck)^  +  Kk  ■  Vk  •  Kf 
Mk+i  =  Ak-Pk-Aj  +  Wi 
Mk+i  =  Ak  -  Pk  -Al  -^-Wk 

Therefore  : 

P,  -  P,  =  {I-Kk-Ck)-{Mk-Mk)-{I-Kk  CkY 

Mk+i  -  Mk+i  =  Ak  ■  {Pk  -  Pk)  -  Ak  +  (  -  Wk  ) 


(66) 
in  the 

(67) 


12 


Clearly  : 


Wi  >  Wk  ,  Mk  >  Mk  Mfc+i  >  Mk+i  V  k 

Wi  <  Wk  ,Mk<Mk  =>  Mk+i  <  Mk+i  V  fc  X 

So,  from  (22), (42)  and  (44)  : 

Wi  >  Wj  =>  Xkj  <  ^ki  <  Ski 

Wi  <  Wj  =>  Akj  >  Aki  >  Ski  Vfc 

These  results  are  used  in  proving  the  following  assertion. 

Assertion  2  If  the  state  estimate  bias  is  sufficiently  small  and 

A.f^j  <C  Ajtt  Sji^{  or  ^kj  ^  ^ki  ^  ^ki  ^  ^ 
then  Jji  15  an  information  function.  Therefore,  when  Hi  v>  tine  : 

Jii{k)  >  Jji{k)  jj^i  ,  W  k  (68) 

Proof  :  Refer  Appendix  B2. 

Remarks  Assertion  2  assumes  that  convergence  to  the  wrong  hypothesis  has  occiurred  and  proves  that 
under  certain  conditions,  the  filter  cannot  remain  in  the  wrong  hypothesis.  The  conditions  spelt  out  are 
sufficient  but  not  necessary.  Moreover  for  the  {)roce.s.ses  shown  in  (65)  and  (67),  the  state  estimate  is 
unbiased  and  from  Lemmas  1  and  2,  Assertion  2  is  always  valid. 


Lemma  3 

Let  Hi  be  true  and  Hm  ^  Q  be  such  that  : 

Jmi  {k)  =  max  Jji  {k)  ^  Tij  e  Q 

3 

V  k 

(69) 

Then 

Pjmi  <1  V  jf  7^  rn 

(70) 

Proof  :  Refer  Appendix  B3. 

For  the  classes  of  problems  that  we  consider,  in  view  of  As.sertion  2  and  Lemma  3  : 

If  Tfi  e  0  =i>  Hi^Hm 

=>  Pjii  <1  (71) 

From  now  on,  we  only  consider  the  classes  of  problems  for  which  As.sertioii  2  is  valid.  We  now  proceed 
to  prove  the  following  theorem. 
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Theorem  1  Let  the  family  7  be  identifiable  and  Hi  €&  be  true.  Then  : 


E[Fk,j/ni]  <  E[Fk.ij/ni]  ^jj^i  172) 

Proof  :  Let  hypothesis  Hi  eQ  he  true.  Since  Fjtj  <  1,  we  have  : 

Fkj  <  Flj  for  some  t  €  (0, 1)  (73) 

This  is  a  crucial  observation  and  is  invoked  to  use  pju  <  1  from  (71).  All  the  terms  in  the  denominator 
of  the  recursive  relation  (60)  are  postive  :  therefore  we  get 

p  ^  {  Fk-ij  +  Pi  •  (  1  -  Fk-I,j  )}■  fkj  jrk)  (74) 

—  {  Fk-l,i  +Pi  •  (  1  ~  Fk-l^i  )}  ■  fki  (^fc) 

,  A  {  Fk-Ui  -bPi  •  ( 1  -  Fk-I,j  )  }  (75) 

-  {Fk-k^+Pi{l-Fk-u)} 

We  note  that  the  density  fimctions  fkji-)  are  the  approximate  conditional  density  functions  as  defined  in 
(45).  Assume  that  Pj  =p  V  j.  Let 

Fk-ij  >  Fk-i,i  j  k  <  N 


Then  : 


p  +  ( 1  -  p )  •  Fk-i,j 

-  p+{l-p)-Fk-ij 

Fk-i,j  + 

Fk-l,,  (1-P)+Ffc_i., 

<  Fk-ij 

Fk^i^i 

(76) 

From  (73), (75). (76)  : 

„  ^  r  Fk^lJ  w  r  fkj  i'f'k)  -|£ 

<  '  /« in) ' 

^  r  Fk-2J  i£  r  /fc-lii  If  r  A'i  (^fc)  if 

-  'f*_2,,  '  ‘A-Mh-i)' 

.  r  It  r  flj  (^l)  It  r  fk}  (^<=)  1« 

(77) 

We  note  that  fki  (  rt  )  are  the  constructed  density  functions  fk  (  Vk  /  Hi  ,  Hk-i  )  in  (42).  Taking  the 


expectation  conditioned  on  Hi  ; 

=  [^Yh 

TTv 


(78) 
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wherein  Ik  is  the  integral.  Now  : 


fkii^kini)  =  fki{rk/nk-i,'Hi)-fk-i,i{rk-i/nk-2,'n^)...fu{ri/ni) 


■J 
=  / 


fij  jri) 
(ri) 


fk-\,i  (fk-l)  f 


fk-l,i 


yfk-iA'^k-i/ni)-dRk-i 


[  •  Ai  (rt  /  Tlk-uni )  ■  dn 

Ski  (n) 

[  T^]‘  •  •  ■  [  -  A_i,i  ( Tlk-i  I  Hi)-  dUk-i  ■  Pjii  ( TZk-i ) 

Jli  (n)  fk-l,i  (rk-l) 


<  h-1  •  Pjii 


Therefore,  from  (78)  : 

TVi 


(79) 


From  Assertion  2  and  Lemma  3,  pju  <  1.  Hence,  the  jjosterior  prol)ability  rai>idly  decrecises  as  k  oo  and 
the  aissumption  of  >  F^-ia  V  j  ^  i,y  k  is  no  longer  valid.  Let  F^-ij  <  F^-ij  V  j  iA  k  >  N. 

Then  : 


4>ji  = 


< 

< 

< 


Fk-i.j  [1  —  p  •  {IfFk-ij  —  1) 

Fk-u  '  l-p-{l/Fk.x,j-l)] 

§^-[l-p-(l/Ffc_ij-l)] 

^k-l.j  -  (1  -  Fk-I.j) 

Fk-XA  ^  Fk-XA 


Following  the  argument  used  al)ove,  we  get  : 


(80) 


Flj 


< 


Fk-:.i 

Fk-XA 


+pr-[ 


fkj  (j  k)  -If 

fki  in) 


rFk-lJ 

^Fk-x,i 


<  [[■ 

r  <  [[• 


Fk-x,j 

Fk-X,i 

Fk-2,i 

Fk-2,i 


r+p‘]-[ 


r-p']-[ 


Therefore  : 


fkj  (^fc)  -If 

fki  irk) 

fk-l,j  (u-l)  jt 
fk-\,i  (rk-l) 


pt  <-  r  Fk-2.j  y  r  fkj  jrk)  •  fk-\,j  jrk-x)  -It  Aj  (rk)  ‘  fk-l,j  (^t-l)  It 

~  Fk-2A  fki  irk)  ■  fk-i,i  irk-\)  fki  irk)  *  fk-i,i  irk-i) 

‘  fki  irk) 

Again,  taking  an  expectation  conditioned  on  Hi  and  from  (63)  : 

E[Fkj/n^]  <  iE;{[^]Vw.}-p";7''’  +  p'-[pj«  +  p"„  +  ---  +  p5.7'"] 
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&  (82) 
«+■-«  =  +  (88) 
<0  >f  k  >  Nk 

Hence  : 

€<...<  Cjt+1  <  €k  <  €fc-i  <  . . .  ^  k  >  Nk  (84) 

Clearly,  for  the  WSPRT,  p  =  0.  and  hence,  the  lower  bound  €  =  0.  ♦ 

Corollary  1  Let  the  family  T  be  identifiable,  Hi  ^  Q  be  true.  Then  : 

E  [  Fkj  /Ui]  <  E{  Fk-ij  /  -W.  ]  V  j  #  m  (85) 

wherein  Hm  maximizes  the  information  function  defined  in  (69). 


Proof :  The  proof  remains  essentially  the  same  as  at  appropriate  places  we  replace  the  subscript  i  by  m, 
except  that  now  pjmi  <  1-  The  corollary  is  relevant  to  the  robustness  issties  associated  with  any  mulitple 
model  adaptive  estimation  scheme. 

5.3  Convergence  of  the  Posterior  Error  Covariance 

Let  Tii  be  true.  We  now  compare  the  posteriori  error  covariance  matrices  of  the  adaptive  filter  ami  the 
“true”  filter  matched  to  "Hi  in  the  MMAE  algorithm.  From  (7)-(15)  and  (22)-(28)  : 

Pki  =  \I-Kk,■Ck^]■Mki  (86) 

Pk  =  [I-KkCk\-Mk  (87) 

Mk  -  Mki  =  Ak-i  ■  Pk-i  •  -  Ak-i,i  ■  Pk-i,i  ■  +  [  Wk-i  -  Wi  ]  (88) 

Therefore  : 

Pk-Pki  =  [I-KkCk]-[Mk-Mki]-[I-Kki-Ckif 

+  [  7  —  Kk  •  Ck\‘  Afjt  •  C^i  •  Tffci  —  Kk  •  Ck  •  Mki  ■  [  7  —  Kki  ■  Cki  ] 

=  [I-Kk-Ck]-[Mk-Mki]-[I-Kki-Ckif  +  Gki  (89) 

Now  from  the  definitions  of  Kk  and  Kki  hi  (8), (23)  : 

Gki  =  Kk,-Cki-Mk  -  Kk-Ck-Mk-Cli-Kli  -  Kk-Ck-Mki 
+  Kki  •  Cki  •  A/i,  •  Ck  •  K/^ 


IG 


=  Mki  ■  Cl  •  ■  Cki  -Mk  -  Mk- cl-  ■  Ck  ■  Mk  ■  cl  ■  ■  Cki  •  Mki 

-  Mk ■cl  s;^  Ck-Mki-\r  Mki •  cl ■  -Cki- Mki- cl -s;^-Ck- Mk 

=  Mki  ■  [Cl -s;:^ -Cki  -  Ck-SZ^-Ck  -  Cl-SZ^-Cki-{Mk-Mki)-Cl  -SZ^-Ck]-Mk 

Now  : 

cl-sZi^-Cki  -  Ck-sz^  Ck  =  cl-(Szl-sz^)-Ck  -  [cl-sz^-i-cl-szD  iCk-Ck,] 

Therefore  : 

Gki  =  Mki-Cl-[SZl-SZ^-Szl-Cki-{Mk-Mki)-Cl-SZ^]-Ck-Mk 

-  Mki  ■{  Cl -SZ^-h  Cl  ■SZ^)-[Ck-Cki]-Mk 

=  Mki  ■  Cl  ■  SZ^  -[Sk-Ski-  Cki  [Mk-  Mki  )Cl]-  SZ^  -Ck-Mk 

-  Mki  ■{  Cl  •  5,7  ^  +  Cl  •  5,- M  •  [  Cr-  -  C,,  ]  •  M, 

=  Kki -[Ck-Mk-Cl  +  Vk- Ck.  ■  Mki  -  Cl  -Vi-  Cki  ■  Mk  ■  Cl  +  Ch  ■  Mki -Cl]-  Kk 

-  Mki  ■{  Cl  •  5,7  ^  +  Cl  -  5,- M  ■  [  C,  -Cki]-  Mk 

=  Kk,-[Vk-Vi]-Kk  +  Kki-[Ck-Cki]-{MkCl  +  Cki-Mki)-Kl 

-Mk.-iCl  ■SZ^+Cl-SZl){Ck-Cki]-Mk  (90) 


Simlarly  from  (88)  : 


Mk  -  Mki  =  -Ak-l  ■  [  Pk-l  -  Pk-l,i  ]  •  +  (  Ak-l,i  -  Pk-\,i  +  ^A:-l  •  Pk-1  )  •  [  Ak-1  —  Ak-\,x  f 

+  [  -  Wi  ] 


From  (88)-(90)  : 


Pk—  Pki  =  [J  -  Kk-Ck]-  Ak-\  ■  [  Pk-\  -  Pk-l,i  ]  ■  Ak^ii  •  [  ^  —  f^ki  ■  Cki 

->r[I  -  Kk-Ck]  -  {  •  Pk-\,i  +  Ak-\  ■  Pk-\  )  •  [  Ak-\  —  Ak-i,i  ]^  •  [  -f  -  Kki  ■  Cki 

+  [I-Kk-Ck]-[Wk-i-Wi]-[I-Kk,-Ckif  +  Kki-[Vk-Vi]-Kk 
■\-Kki-[Ck-Cki]-{Mk- Cl  +  Cki  ■  Mki  )  ■  Kl 

-  Mki- {Cl -SZ^-^  Cl  •  5,- M  •  [  Ck  -Ck.]-Mk  (91) 


The  filter  equations  are  : 

Xki  =  [J  —  Kki  -  Cki]-  Ak-i,i  -  Xk-i,i  +  Kki  -  ivk-  Cki  ■  h-\,i  —  dki  ) 

Xk  =  [J^  —  Kk  -  Ck]  -  •  Xk-\  Kk  -  {  ijk  -  Ck  -  h-i  —  (^k  ) 
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Denote  the  state  transition  matrices  as  : 


(fc,  fc  —  1)  =  [  /  —  Kki  ■  Cki  ]  • 

$  (fc,  fc  -  1)  =  \I  —  Kk  -  Ck]  -  Ak-\ 

^ki  =  [r-KkCk]-i  Ak-i,i  •  Pk-i,i  +  Ak-i  •  Pk-i  )  •  [  Ak-i  -  Ak-i,i  f-[I-Kki-  Cki 

+  [I-Kk-Ck]-[Wk-i-Wi]-[I-Kki-Ckif  +  Kki-[Vk-Vi]-Kk 

+  Kki  •  [  Ck  ~  Cki  ]  •  (  A/fc  •  Ck  +  Cki  ’  ^ki  )  '  ^k 

—  Mki  •  (  C'J  •  +  C^i  ■  Ski  )  -  [Ck  —  Cki]  -  Mk  (92) 

Therefore  : 

6Pki  =  Pk-Pki 

=  $  (fc,  k-1)-  dPk-i,i  ■  {k,  k-1)  +  ^ki  (93) 

We  now  prove  the  following  theorem  : 

Theorem  2  If  the  system  in  (l)-(4)  is  uniformly  completely  controllable  and  uniformly  completely  observ¬ 
able,  and  if  **  uniformly  bounded  and  decreasing,  then  : 

E  {WSPkiW  / -Hi}  <  Cki 


wherein  : 

£,  <...<  Ck+\,i  <  Cki  <  A- 1,1  <  •••  k  >  Nk 


(94) 


Proof :  From  (93) 

A- 

SPki  =  ^  iK  0)  ■  SPo,  ■  (k,  0)  +  ^  $  (fc,  /)  •  ^li  •  {k,  1) 

1  =  1 
k 

=»  E  {SPk,  /Hi}  =  ^  [k, 0)  •  <5Po.  •  (A:, 0)  +  ^  ^  {k, /)  ■  E  {'I'/i  /  Hi)  •  (k,  1)  (95) 

1=1 

Since  the  system  is  uniformly  completely  controllable  and  observable  : 

\\^{k,l)\\  <  Cl  • 

||$i(A:,0ll  <  Cs  •  V  Ci ,  C2 ,  c.i ,  C4  >  0  (96) 

Further  : 

-  W^.ll  =  II E  -  ^’11 

j=0 

<  IK  1  -  Fk-i,i  )  ■  Wo  -  (  1  -  Fk-I,i  )  •  W^ill  wherein  =  max  Wj 

<  Fr-i.a-||l^a-H'.il 


(97) 


Similarly  : 


\\Ak-i-Ai\\  <  Fk-i,t,-\\A,-Ai\\ 
\\Vk-Vi\\  <  F*_a,c-||V'e-V;|| 

lie* -Gill  <  Ft-r,a-\\C^-Ci\\ 

From  (92), (97), (98)  : 


wherein  Ab  =  max  Aj 

wherein  Vc  =  max  Vj 
j 

wherein  Ca  =  max  Cj 


(98) 


EmkiW/ni}  <  ll'^ll-lkitill 


(99) 


wherein  from  Theorem  1  and  (84),  {c*i}  is  monotonically  decreasing  V  A:  >  iV*  and  bounded  from  below, 
and  ’i'  is  some  matrix  defined  from  (92).  Clearly,  from  (95)-(99)  : 


E{\mi\\/ni} 


< 

A 


IlCill-pPoill -11^311 


+  Iieiii-Ii'i'ii-I 


e-(‘^*+='‘H"-')-||£«||]-,|e3|| 


Cki 


Ck+I,i-Cki  =  ||Cil|-||d'Po.|l-|lC3||-e-(‘=^+^^^*^.[e-(‘^»+‘=^)-l^ 

+  liei||- Ill'll  •||€uM|C3||-e-(‘^^+^^>-*^ 

+  lieill  ■  ll^ll  •  [  ^  -11  _  lie, ,11}  ]  -  IIC3II 

/=1 

-  Cki  <  0  V  A;  > 


Hence  : 


Ci  <  ...  <  Ck^i^i  <  Cki  <  Ck^iA  <  ...  V  A;  >  ^ 

Remarks  :  We  note  that  the  lower  bound  is  governed  by  the  factor  p  of  the  MHSSPRT  as  it  controls  the 
lower  bound  of  the  sequence  {^ki},  as  shown  in  Theorem  1.  This  theorem,  based  upon  our  adaptive  filter 
structure,  shows  that  the  apriori  “covariance"  assumed  for  the  conditional  density  function  of  the  residual 
fki  ( '^k  )  approaches  the  true  apriori  error  covaiiance,  i.e,  E  {  Mjt  /  }  — >  Mki-  As  a  special  case,  we 

consider  the  WSPRT,  wherein  p  =  0.  From  Theorem  1,  it  can  be  seen  ; 


=>  E{\\SPki\\/ni} 

=>  E{\\sPki\\/ni} 


^  wherein  p  <  1 

<  l|C'i||-pPo.||-||C'3||-e-<‘=^-""^’‘ 

.  ^(c2+C4)*(^+1)  _  ^(c2+C4)*A: 

+  IICill  •  ll»ll  •  [- -  ..  ,...3.;:^ - 1  •  IlCsIl 


ft  ■  e':2+C4  _  1 


=  0  as  k  -4  30 


(100) 


So,  for  the  WSPRT,  Ci  =  0.  The  adaptive  filter  converges  exactly  to  the  “true”  filter  of  the  MMAE 
scheme. 
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The  assumptions  in  the  AKF  algorithm  may  be  justified  in  yet  another  way,  by  looking  at  the  exact 
expressions  for  the  overall  state  estimate  and  posteriori  error  covariance,  as  developed  in  the  MMAE 
algorithm.  Recall  from  (16)  and  (17)  : 

4  =  Y^^kj-Fkj 

j 

p;  =  51  {  Pkj  +  (xl-Xkj)-  {xl-  Xkj  )^  }  •  Fkj 
j 

Let  : 

sp^  =  p;  -  Pk 

We  now  show  that  the  expected  value  of  SP^  conditioned  on  any  hypothesis  decreases  as  k oo. 

Theorem  3  If  the  system  in  (l)-(4)  is  uniformly  completely  controllable  and  uniformly  completely  observ- 
able,  then  : 

EiimW/Hi}  <  CU  'i'Hi 

wherein  : 

£•<...<  <  Cli  <  <  ...  V  A:  >  Nk 

Proof  :  From  the  above  equations 

6Pk  =  5Z  {  {  -  Pfc  }  +  ( -  Xkj  )-{xl-  Xkj  f  }  ■  Fa; 

j 

=  51  {  ^Fkj  +  {Xk-  Xkj  )-{xk-  Xkj  )^  }  •  Fkj 
j 

=  ^Pkj  +  (  -  Xfcj  )  •  (  xl  -  Xkj  )^  }  •  Fkj 

+  {  6Pki  {x*k-  Xki )  ■  ( xl,  -  xki  )^  }  •  Fki  (102) 

Taking  the  norm  and  using  an  analysis  similai-  to  (98)  : 

+  {  \\SPki\\  +  IK  a;I  -  Xki )  •  ( 4  -  ^ki  )^*i  }  •  Fki 

PPfcjll  =  llPfci-Pfcll 

=  llP/tj-Pfci  +  Ffci-Pfcll 

<  PPfciill  +  PFjtill  <  PP,-.||  +  PA..1|  ji^i  (103) 
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rn^j 


WSXkjW  =  \\{xl-xkj)-ixl-xkjf\\ 

<  Fkjf -Wxkm-XkjW'^ 

<  {l-Fkj)'^-\\5Xj\\  (104) 

sp;:  <  E  { +  ( 1  -  )"  • 

+  {  WSPkiW  +  [l-Fkif-  PXill  }  •  Fki  (105) 

Prom  Theorem  1  and  Theorem  2,  V  fc  >  AT/t  : 

E{Fk+ij/ni}  <  E{Fkj/ni} 

E{\\SPk+i,i\\/'Hi}  <  E{\\SPki\\/ni} 

E{Fk+i,i/ni}  >  E{Fki/ni}  (106) 

Therefore  : 

E  {SP:  /Hi}  <  £?  {  C  ( +  ( 1  -  )"  •  } 

+  £  {  [  (  WSPkiW  +  ( 1  -  F,i  f  ■  px,||  ) .  Ffei]  /  -Hi  } 

=  F  {  [E  (  +  ( 1  -  Fkj  f  ■  )  •  Fkj]  /  Mi  } 

+  E{[il-Fkif-  ||<5X.||  ■Fki]/n.}  +  E{  pPfcill  /  -Hi  } 

^  r 

—  k-ki 

Now  : 

=  E  ^  lUr)  ^  E{(  ||<5PiL.^i,,||  -  \\SPk^\\  )/n.} 

j^i 

+  ^  ll^X.ll  E{{{1^  ^  ((1  -  F,jf  •  F,J)  /  Hi  }  (107) 

i 

Consider  the  function  : 

E{i{l-Fkjf-Fkj)/ni}  =  E{Fkj/ni}  +  E{F-^j/ni}  -  2-E{F^j/ni} 

<  2E{Fkj  /Mi}  -  2E{Fi^  /Hi) 

Now  E{F'ij/n.}  >  iE{Fkj/ni}f 

E{i{l-Fkjf-Fkj)/-Hi}  <  2-[E{Fkj/'Ki}  -  {E{Fkj/ni}f] 

^  ^(E{Fkj/n.}) 
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Since  <p  (•)  is  an  increasing  function  in  the  interval  [  0  .  1/2  )  and  a  <lecreasing  function  in  the  interval 
( 1/2  ,  1  ]  ,  from  (106)  and  (107),  we  have  : 

Fk+ijf  ■  Fk+ij)  /Hi}  <  £?{((!-  Ftjf  ■  Fkj)  /Hi)  ^  j  ^  i 

£;{((!-  Ft+i,i)2  •  Ffe+1,0  /Hi)  <  F  {  ((1  -  Fki?  •  F^i)  /  Hi  } 

=►  <0  k>Nk  (108) 

Hence  : 

C\<...<  £1+1, i  <  CU  <  4_i,i  <  ...  ^  k  >  Nk  ♦ 

Clearly  the  posteriori  “covariance”  of  the  AKF  algorithm  approaches  the  exact  posteriori  error  covariance 
as  computed  in  the  MMAE  algorithm. 

Finally,  we  analyze  the  difference  between  the  assumed  error  “covariance”  Pk  and  the  exact  error 
covariance  Pk  •  Let  Hi  be  true,  FVom  (25), (32)  and  (33)  : 

SPki  =  Pk  -  Pk 

=  ( I  -  Kk  Ck)  ■  Ak  -  {  Pk-i  -  Pk-i)  ■  Aj  ■  {  I  -  Kk  Ckf  +  ^k 

=  (Jt,  k-1)-  5Pk-i  ■  $  (fc,  k  -  1)^  +  %  (110) 

wherein  : 

^k  =  Kk  {  (Vk  -  Vi)  -  {Ck  -  Cki)  ■  Xk  ■  {Ck  -  Cki)'^  +  {dk  -  dki)  -  (4  -  4.)^ 

^  2  •  {Ck  -  Cki)  •  rtik  •  {dki  -  dk)^  }  •  KJ 

-  {I  -  Kk  -  Ck)  ■  {I-  El  ■  {Ck  -  Cki)^  -  2  •  ml  ■  {dk  -  dki)'^  }  •  KI 
-[I-Kk-Ck)-{  (Wi-i  -  Wi)  -  (Ai:_i  -  Ak-u)  ■  Xk-i  •  {Ak-i  -  Ak.i,i)^ 

-  {h-i,i  -  bk-i)  •  (&ik-i,t  “  h-i)^  +  2  •  Ak-i  •  Ek^i  •  {Ak-\  -  Ak-i,i)^ 

+  2  •  Ak-i  ■  mj-i  •  {h-i  - 

-  2  •  {Ak-i  -  Ak-i,i)  •  rrik-i  ■  {bk-i  -  h-i.i)^  }  •  {I  -  Kk  ■  Ck)^ 

We  now  prove  that  the  expected  value  of  SPki  conditione<l  on  Hi  decreases  as  fc  oo. 

Theorem  4  If  the  system  in  (l)-(4)  is  uniformly  completely  controllable  and  uniformly  completely  observ- 
able,  then  : 

E{\\6Pk,\\/Hi}  <  Cki  V 

wherein  : 

£i  <  ...  <  4+1, i  <  4i  <  4-1, i  <  •••  V  A:  > 
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Proof :  Prom  (110) 


k 

SPki  =  ^  (k,  0)  •  SPoi  •  (k,  0)  +  53  $  (fc,  0  ■  'I'/i  •  (fc,  1) 

/=! 

A; 

=i>  E  {SPki  /  ^.}  =  ^  {k,  0)  •  SPoi  ■  {k,  0)  +  53  $  (fc,  1)  •  E  {^H  /  -Wi}  •  (/:,  /) 

/=1 

The  rest  of  the  proof  follows  that  of  Theorem  2,  X 

This  concludes  our  analysis  to  justify  the  structure  of  the  AKF  algorithm.  Under  “H,,  we  derived 
sufficient  conditions  for  the  convergence  of  F^i  to  and  Pk  to  Pjfc ,  Pki ,  and  Pj^  .  In  the  next  section,  we 
test  the  AKF  algorithm  in  a  few  numerical  simulations 

6  Simulations 

6.1  Example  1 

Consider  a  scalcir  dynamic  system  ; 

+  bk  +  U’k 
Vk  =  Ck'  Xk  +  dk  +  Vk 

wherein  under  each  hi.'pothesis  : 


-Ho  : 

:  Ak  =  -0.5 

bk 

=  0.00 

Ck  =  1.00 

dk  =  0.00 

Vk  ~  (  0  ,  1.0  ) 

.  m 

~  AT  (0,  0.001  ) 

Hi  : 

:  Ak  =  -0.6 

bk 

=  0.25 

Ck  =  1.25 

dk  =  0.25 

Vk  ~  A'  (  0  ,  2.0  ) 

Wk 

~  {  0  ,  0.001  ) 

'H2  : 

1^ 

o 

1 

11 

bk 

=  0.50 

Ck  =  l.oO 

dk  =  0.50 

~  A  (  0  .  3.0  ) 

m 

~  ( 0 , 0.001  ) 

We  compared  the  Adaptive  Kalman  Filter  to  the  MMAE  algorithnn  In  the  MMAE  approach,  was 
replaced  by  to  allow  for  transitions  from  one  hypothesis  to  another.  Of  course,  from  our  earlier 
discussion,  it  is  clear  that  the  recursive  relation  is  not  strictly  but  an  approximation  to  it.  In  order  to 
design  the  AKF  algorithm,  it  is  essential  to  consider  scenarios  when  a  particular  hypothesis  is  true  and  the 
filter  is  “matched"  to  the  wrong  hypothesis.  An  off-line  computation  of  the  true  residual  error  covariance 
was  conducted  for  all  .scenarios.  It  is  seen  from  figure  3  that  when  'Hi  is  true  and  the  filter  is  matched  to 
"Hj  ,  either  Akj  <  Aki  <  Ski  or  Akj  >  Ajt,  >  Ski  •  Moreover  the  matrix  Bki  in  the  exponential  term 
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Figure  3:  Off-line  computation  of  Akj  ,  Aki  and  Ski  :  Wi  vs  Uj  denotes  Tii  is  true  while  the  filter  is 
matched  to  “Hj  :  Aki  is  shown  by  the  dotted  line 

is  always  positive  definite  and  so,  from  Appendix  B2,  the  system  satisfies  Assertion  2.  This  would  imply 
that  the  filter  cannot  remain  matched  to  Tij. 

We  now  test  the  AKF  algorithm.  At  t  =  40  sec,  the  hypothesis  was  changed  from  Ho  to  Hi-  The 
posterior  probabilities  of  the  three  hypotheses  are  shown  in  figure  4.  The  bold  line  denotes  the  AKF 
approach  while  the  dotted  line  denotes  the  MMAE  approach.  However,  the  computational  time  taken  by 
the  MMAE  approach  is  much  larger  than  the  AKF  approach.  These  plots  have  been  averaged  over  ten 
different  realizations. 

Figure  5  shows  the  normed  differences  between  the  posteriori  error  covariance  matrix  of  the  AKF  and 
each  of  the  Lainiotis  filters.  For  t  <  40  seconds,  Ho  is  true  ;  As  proved  in  Theorem  2,  E  {  |l(5P/:o||  /Ho}  ^0 
while  E  {  WSPkiW  /  Ho  }  and  E  {  \\SPk2\\  /  }  aie  high.  For  t  >  40  seconds.  Hi  is  true  :  Therefore 

E  {  ||<5Pjti|l  /Hi}-*0  while  E  {  ||(^P*oll  /  "Wi  }  and  E  {  ||<5P*.2ll  /  }  aie  high. 

6.2  Example  2 

Consider  another  dynanuc  system  wherein  under  each  hypothesis  : 

Ho  :  Ak  =  0.5  bk  =  0.00  Ck  =  100  dk  =  0.00 

Ufc  ~  A"(0,  1.0)  tufc  ~  AT  (0,0.001) 

Hi  :  Ak  =  0.6  bk  =  0.25  Ck  =  1.25  dk  =  0.25 
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Adaptive  Kalman  Filter  vs  Lainiotis  fitters 


Time 


Figure  6:  Adaptive  Kalman  Filter  Performance  -  Change  from  'Ho  to  H2 

Vfc  ~  (  0  ,  1.5  )  Wk  ^  Af  {0 ,  0.001  ) 

H2  ••  Ak  =  0.7  bk  =  0.50  Ck  =  1.50  dk  =  0.50 

Vi  ~  A/"  (  0  ,  2.0  )  Wk  ~  Af  {0,  0.001  ) 

Again,  we  compared  the  Adaptive  Kalman  Filter  to  the  MMAE  algorithm.  At  t  =  3  sec,  the  hypothesis 
was  changed  from  Ho  to  H2.  The  posterior  probabilities  of  the  three  hypotheses  are  shown  in  figure  6. 
The  plots  have  been  averaged  over  ten  different  realizations. 

Figure  7  shows  the  normed  differences  between  the  i)Osteiioii  error  covariance  matrix  of  the  AKF  and 

each  of  the  Lainiotis  filters. 

6.3  Example  3 


Consider 

3  hypotheses  wherein  : 

Ho 

:  vfc  ~  A7(0,  1.0) 

ni 

:  Vi  ~  A/"  (  0  ,  1.5  ) 

n2 

:  Vi  ~  AA  (  0  ,  2.0  ) 

A  fourth  order  ARMA  measm-ement  process  was  siirmlated  thus  : 

yk  =  0.1  •  [yt-i  -  yi-2  +  yit-3 -yfc-4  ]  +  «A- 


2G 


Difference  between  Error  Covariances 


Figure  7:  £;  {  IlfiPfcill  }  vs 

Since  the  order  of  the  ARMA  process  is  t3rpically  unknown,  a  fifth  order  ARMA  model  was  assumed  for 
the  measurement  process.  The  assumed  system  model  is  the  same  as  (l)-(4)  and  V  Tlj  : 

A  =  I  Wj  =  omi*T  d  =  0 

Ck  =  b  =  [OOOOOf 

Recall  from  Lemma  1  and  Appendix  B2  that  for  ARMA  processes,  A.ssertion  2  is  always  valid  thereby 
obviating  any  off-line  computation.  At  <  =  40  sec,  the  hyi)othe.sis  was  changed  from  'Ho  to  H\.  The 
posterior  probabilities  of  the  three  hypotheses  are  shown  in  figure  6.  The  i)lots  have  been  averaged  over 
ten  different  realizations. 

Figure  9  shows  the  normed  differences  between  the  posteriori  error  covariance  matrix  of  the  AKF  and 
each  of  the  Lainiotis  filters. 

6.4  Example  4 

For  the  same  system,  the  hypothesis  was  changed  from  Ho  to  H-2  at  t  =  40  seconds.  The  posterior 
probabilities  of  the  three  hypotheses  aie  shown  in  figiue  10.  Again  the  i)lots  have  been  averaged  over  ten 
different  realizations. 

Figure  11  shows  the  normed  differences  between  the  posteriori  error  covai  iance  matrix  of  the  AKF  and 
each  of  the  Lainiotis  filters. 
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Adaptive  Kalman  Filler  vs  Lahiotis  filers 

T - 1 - — I - 1 - ! - ! - 1 - T 


Time 


Adaptive  Kalman  Filter  Performance  -  Change  from  Ho  ^ 


Difference  between  Error  Covariances 


r  "  ' 

r "  '  '1  1  ' 

_ i - » - 1 - i - - 

)ir^  1  ■ —  I  I  ■ - ^ ^ - * - ■ - ^ 

0  1  0  20  30  40  50  60  70  80  90 


Figine  9:  JS  {  |l(5Pitil|  }  vs  Ir 
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Adaptive  Kalman  Fitter  vs  Lainiotis  filters 


Figure  10;  Adaptive  Kalman  Filter  Performance  -  Change  from  to 


Difference  between  Error  Covariances 


Time 


Figure  11:  E  {  pPfci||  }  vs  tk 
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7  Conclusions 


An  AKF  algorithm  and  sufficient  conditions  for  its  convergence  have  been  developed  for  adaptive  estimation 
in  linear  time-varying  stochastic  dynamic  systems.  In  the  simulated  examples,  it  is  seen  to  perform  on  par 
with  the  modified  MMAE  algorithm,  while  significantly  reducing  the  computational  intensity.  It  has  also 
been  shown  that  for  a  class  of  problems,  the  expected  value  of  the  true  posterior  probability  conditioned 
on  the  residual  history  coverges  to  unity.  In  its  most  general  form,  an  off-line  computation  is  necessary 
to  investigate  the  convergence  of  the  true  posterior  probability.  Under  assumptions  of  uniform  complete 
controllability  and  observability,  the  expected  value  of  the  norm  of  the  difference  between  the  constructed 
error  covariance  and  the  true  posteriori  error  covariance  converges  to  a  lower  bound.  This  lower  bound 
is  determined  by  the  apriori  probability  of  change  from  one  hypothasis  to  another  in  the  MHSSPRT.  In 
the  presence  of  modeling  errors,  the  AKF  algorithm  has  been  shown  to  coverge  to  the  hypothesis  which 
maximizes  a  particular  information  function,  while  the  MMAE  algorithm  might  show  beta  dominance. 
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Appendix  A 


From  (60).  for  the  MMAE  scheme  : 


4>ki 


Pk+\,i 


Fk+l,i  ^ki 


=  Fk,i+Pi-(1-Fk,i) 

=  0k+l,i  ■  Olk+l,j 

_  4>ki  •  fk+lA‘'‘k+\,i) 

_  <i>ki  •  Pk-\-U  •  QA:-rl,t 

^2j=0  ^kj  ■  Pk+l,j  ■ 

<f>ki  •  (  1  ~  Fki  )0k+l,i  •  ~  ^kj  ‘  Pk+\,j  ‘  l,j  '  Fki 

~  Ej=0  <l>kj  ■  Pk+\,j  ■  «A+1,J 


If  'H.  is  true,  then  we  would  expect  : 


afc+i,i  «  0 

=>  Fk+i^i-Fki  >  0 


V  i  #  i 
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Fk+l,i  -  Pki 


However,  if  for  some  unknown  reason,  cxkj  ~  a  V  j  for  a  prolonged  sequence  of  measurements  : 

(  1  -  P  )  •  Ej/i  Fkj  •  {0k+l,i  —  Pk+l,j}  •  Pki  +  P  •  Zjyti  {^fc+1,1  -  0k+lj)  ‘  Pki 

<Pkj  ■  Pk+lj 

Pki  <  Pkj  7^  h  the  posterior  probability  corresponding  to  the  dominant  P  increases 

irrespective  of  the  true  hypothesis. 


Appendix  B1 

For  the  AKF  algorithm,  there  is  only  one  residual  process.  Hence,  if  for  .some  reason,  akj  ^  a  V  j  for 
a  prolonged  sequence  of  measurements  : 

^  ’”1+1  •  [  ^k+l,i  -  Sk+lj  ]  ■  Tk+l  =  0 

^  Sk+l,i  —  Sk+l,j  V  k 

This  violates  the  identifiability  assumption  of  the  family  .  Moreover,  since  now  Pki  =  Pkj  3i  one  P 
cannot  dominate  over  the  other. 


Appendix  B2 

From  (62)  ; 

Jjiik)  =  E[\n{fkj(rk)} /-Hi,nk-i] 


Let  Hi  be  true.  Then  : 

Jji  {k)-  Jn  (  k  ) 

= 

Since  In  x 

< 

X  —  1 

Jji  {k)-  Ju  (  k  ) 

< 

< 

A 

4  -  1 

From  (41),  (42)  and  (44)  : 


Y  Ii--/vn  _  f  ^  _  1 

h  =  - ,  -T/2  *  \  ^  / 


wherein  Oki  =■  '  [  ^ki  +  {^kj  “  Ki)~^  ]"^  '  hi 

=  bki  ■  Bki  ■  hi 


^k^  =  ^k^  +  ^kj  ~  ^ki^ 

>  0  for  the  integral  to  exist 


'i  j  ^  i 
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Now  : 


Either  <  A^i  <  Ski  or  Akj  >  Aki  >  Ski  V  k 

Clearly,  the  integral  always  exists  and  At  >  0 .  Since  the  bias  terms  are  small,  we  neglect  the  exponential 
term  aki  ^  however,  we  do  note  that  if 

^kj  ^ki  ^  Bki  >  0 

^ki  ^1^  ^  ^ki 

r  -1  1  , 

=►  exp  {  —  •“*»)  <  1 

Hence  in  certain  cases,  the  bias  terms  need  not  be  small.  Anyway,  removing  the  exponential  term  from 
Ik  ,  we  can  show  that  : 

h  —  II  ^ki  ■  +  ^ki  ■  —  ^ki  '  ^kj  '  ^ki  ‘  II 

<  1 

with  the  equality  sign  if  and  only  if  Aki  =  Akj  or  Aki  =  Sk  V  k.  The  former  situation  violates  the 
identifiability  assumption  while  the  latter  assumes  that  Mk  =  Mk  V  k  ,  in  which  case  the  algorithm  has 
already  converged.  Therefore  : 

Jji(k)-Jii{k)  <  0 

Now,  the  equality  sign  in  holds  good  if  and  only  if  fki{-)  =  fkj(-)  almost  everywhere.  Since  we  assumed 
the  family  T  to  be  identifiable,  the  Jji{k)  is  strictly  less  than  Ju  {k)  V  j  and  V  k. 

Appendix  B3 

The  proof  follows  the  analysis  in  [7].  Let  Hi  0  ©  be  true.  We  first  prove  that  whenever 

Jji  {k)  <  Jmiik)  Vfc: 

E  [  {  fr  \y  /  Hi ,  Tlk-i  ]  =  pjmi{T^k-i)  <  1  for  some  t  G  (0, 1)  V  k 

fkm  ) 

By  definition  : 
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Using  the  Lebesgue  dominated  convergence  theorem,  the  limit  and  expectation  may  be  interchang 

Therefore,  for  any  S  £  0, 1),  there  exists  a  ^  €  (0, 1)  such  that  :  ^ 

o 

lim  <  [jji{k)-Jmi{k)]-{i-S)  S 

Jkm  \  ^k  ) 

<  l  +  t-{l-S)-[Jjiik)-Jmiik)]  g 

Jkm  [  rk ) 

<  1 

The  same  analysis  can  be  carried  out  V  k.  In  other  words,  for  any  realization  of  IZk  : 

Pjmi  ( '^k  —  1  )  ^  1  V  A: 

^  pjmi  “  max  pjmi  (  1  ) 

k 

<  1 
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